A Modular Approach to Spoken Language Translation for Large Domains

نویسندگان

  • Monika Woszczyna
  • Matthew Broadhead
  • Donna Gates
  • Marsal Gavaldà
  • Alon Lavie
  • Lori S. Levin
  • Alexander H. Waibel
چکیده

The MT engine of the Janus speech-to-speech translation system is designed around four main principles: 1) an interlingua approach that allows the eecient addition of new languages, 2) the use of semantic grammars that yield low cost high quality translations for limited domains, 3) modular grammars that support easy expansion into new domains, and 4) eecient integration of multiple grammars using multi-domain parse lattices and domain re-scoring. Within the framework of the C-STAR-II speech-to-speech translation effort , these principles are tested against the challenge of providing translation for a number of domains and language pairs with the additional restriction of a common interchange format.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rapid Portability among Domains in an Interactive Spoken Language Translation System

Spoken Language Translation systems have usually been produced for such specific domains as health care or military use. Ideally, such systems would be easily portable to other domains in which translation is mission critical, such as emergency response or law enforcement. However, porting has in practice proven difficult. This paper will comment on the sources of this difficulty and briefly pr...

متن کامل

Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers

In this paper, we describe a novel approach to spoken language analysis for translation, which uses a combination of grammar-based phrase-level parsing and automatic classification. The job of the analyzer is to produce a shallow semantic interlingua representation for spoken task-oriented utterances. The goal of our hybrid approach is to provide accurate real-time analyses while improving robu...

متن کامل

Analysis for Speech Translation Using Grammar-Based Parsing and Automatic Classification

In this paper, I describe a novel approach to analysis for spoken language translation which uses a combination of phrase-level grammar-based parsing and automatic classification. The job of the analyzer is to transform spoken taskoriented utterances into a shallow semantic interlingua representation. The goal of this hybrid approach is to provide accurate real-time analyses and to improve robu...

متن کامل

Lightly-Supervised Word Sense Translation Error Detection for an Interactive Conversational Spoken Language Translation System

Lexical ambiguity can lead to concept transfer failure in conversational spoken language translation (CSLT) systems. This paper presents a novel, classificationbased approach to accurately detecting word sense translation errors (WSTEs) of ambiguous source words. The approach requires minimal human annotation effort, and can be easily scaled to new language pairs and domains, with only a wordal...

متن کامل

Synthetic Data for Neural Machine Translation of Spoken-Dialects

In this paper, we introduce a novel approach to generate synthetic data for training Neural Machine Translation systems. The proposed approach transforms a given parallel corpus between a written language and a target language to a parallel corpus between a spoken dialect variant and the target language. Our approach is language independent and can be used to generate data for any variant of th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998